Acquisition of Stand-up Behavior by a Real Robot using Hierarchical Reinforcement Learning

نویسندگان

  • Jun Morimoto
  • Kenji Doya
چکیده

In this paper, we propose a hierarchical reinforcement learning architecture for a robot with large degrees of freedom. In order to enable learning in a practical numbers of trials, we introduce a low-dimensional representation of the state of the robot for higher-level planning. The upper level learns a discrete sequence of sub-goals in a low-dimensional state space for achieving the main goal of the task. The lower-level modules learn local trajectories in the original high-dimensional state space to achieve the sub-goal speci ed by the upper level. We applied the hierarchical architecture to a three-link, two-joint robot for a task of learning to stand up by trial and error. The upper-level learning was implemented by Q learning, while the lowerlevel learning was implemented by a continuous actor-critic method. The robot successfully learned to stand up within 750 trials in simulation and then in an additional 170 trials using real hardware.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Dynamic Obstacle Avoidance by Distributed Algorithm based on Reinforcement Learning (RESEARCH NOTE)

In this paper we focus on the application of reinforcement learning to obstacle avoidance in dynamic Environments in wireless sensor networks. A distributed algorithm based on reinforcement learning is developed for sensor networks to guide mobile robot through the dynamic obstacles. The sensor network models the danger of the area under coverage as obstacles, and has the property of adoption o...

متن کامل

Reinforcement learning of dynamic motor sequence: learning to stand up

I n this paper, we propose a learning method f o r implementing human-like sequential movements in robots. As an example of dynamic sequential movement, we consider the “stand-up” task f o r a two-joint, three-link robot. In contrast t o the case of steady walking or standing, the desired trajectory fo r such a transient behavior is very dificult t o derive. The goal of the task is to find a pa...

متن کامل

Hierarchical Reinforcement Learning of Low-Dimensional Subgoals and High-Dimensional Trajectories

In this paper, we propose a hierarchical reinforcement learning method which enables a learner to learn tasks in a highdimensional state space. In the upper level, the learner coarsely explores the low-dimensional state space. In the lower level, the learner nely explores the high-dimensional state space. Speci cally, the learner learns to set up appropriate subgoals for the task in the upper l...

متن کامل

Behavior Acquisition in RoboCup Middle Size League Domain

The RoboCup middle size league is one of the leagues that have the longest histories in RoboCup. This league has unique features, for example, bigger robots (around 45cm square) plays on the largest field (say, 18m×12m in 2007), any global sensory system is not allowed to use, all robots have on-board vision systems and controllers. Each robot plays based on its own sensory information, and it ...

متن کامل

Accelerating flat reinforcement learning on a robot by using subgoals in a hierarchical framework

Learning a motor skill task with Reinforcement Learning still takes a long time. A way to speed up the learning process without using much prior knowledge is to use subgoals. In this study, the use of subgoals decreased the learning time by a factor nine and we show that tests on a real robot give similar results. The price to be paid, in case the subgoals do not lie on the optimal path, is a w...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Robotics and Autonomous Systems

دوره 36  شماره 

صفحات  -

تاریخ انتشار 2000